Efficient Reinforcement Learning in Continuous Environments
نویسنده
چکیده
Reinforcement Learning (RL) is a machine learning paradigm with which autonomous agents can improve their behavior in unknown environments based on their own experience without an explicit teacher signal. RL algorithms are based on estimating a value function over the state space, and scaling them to large state spaces remains a challenge. One approach, known as variable resolution, is to focus representational power in regions of the state space where experience shows it is most needed. One of the most promising variable resolution methods is partigame, which has competitive performance and has shown promising scalability on the class of deterministic, continuous, goal-type RL problems. It performs dynamic, kd-tree-based partitioning of the state space based on a game-theoretic approach to assigning costs to partitions, and it uses an a priori local controller to try to navigate through the partitions it creates to reach the goal. Despite its promise, however, parti-game has a number of shortcomings relating to efficiency, consistency, sub-optimality of solutions found and reliance on a designer-supplied local controller. This work introduces a family of variable resolution algorithms, in the same spirit of parti-game, that addresses each of these drawbacks, thus providing a powerful, a-priori -model-independent paradigm for finding higher quality solutions of
منابع مشابه
Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs
We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of finding solution trajectories for such problems can be reduced by incorporating limited prior knowledge of the approximative local system dynamics. The presented algorithm builds an adaptive state graph of sample points within...
متن کاملEfficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis
We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated actions in the continuous state setting. This novel extension is appropriate for environments with continuous or finely discretized states where generalization is required, as is the case for data analysis of randomized...
متن کاملA Self-organizing Multi-agent System for Adaptive Continuous Unsupervised Learning in Complex Uncertain Environments
Introduction. Continuous learning and online decisionmaking in complex dynamic environments under conditions of uncertainty and limited computational recourses represent one of the most challenging problems for developing robust intelligent systems. The existing task of unsupervised clustering in statistical learning requires the maximizing (or minimizing) of a certain similarity-based objectiv...
متن کاملReinforcement Learning In Real-Time Strategy Games
We consider the problem of effective and automated decisionmaking in modern real-time strategy (RTS) games through the use of reinforcement learning techniques. RTS games constitute environments with large, high-dimensional and continuous state and action spaces with temporally-extended actions. To operate under such environments we propose Exlos, a stable, model-based MonteCarlo method. Contra...
متن کاملSample Efficient Actor-Critic with Experience Replay
This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stocha...
متن کاملEfficient Model-based Exploration in Continuous State-space Environments
OF THE DISSERTATION Efficient Model-based Exploration in Continuous State-space Environments by Ali Nouri Dissertation Director: Michael L. Littman The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the environment for the purpose of better decision making. As such, exploration plays a crucial role in the efficiency of RL algorithms. In this dissertation,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001